Skew Detection/Correction and Local Minima/Maxima Techniques for Extracting a New Arabic Benchmark Database

نویسنده

  • Husam Ahmed Al Hamad
چکیده

We propose a set of techniques for extracting a new standard benchmark database for Arabic handwritten scripts. Thresholding, filtering, and skew detection/correction techniques are developed as a pre-processing step of the database. Local minima and maxima using horizontal and vertical histogram are implemented for extracting the script elements of the database. Elements of the database contain pages, paragraphs, lines, and characters. The database divides into two major parts. The first part represents the original elements without modifications; the second part represents the elements after applying the proposed techniques. The final database has collected, extracted, validated, and saved. All techniques are tested for extracting and validating the elements. In this respect, ACDAR proposes a first issue of the Arabic benchmark databases. In addition, the paper confirms establishment a specialized research-oriented center refers to learning, teaching, and collaboration activities. This center is called "Arabic Center for Document Analysis and Recognition (ACDAR)" which is similar to other centers developed for other languages such as English. Keywords—ACDAR; Arabic benchmark database; Arabic scripts; document analysis; handwriting recognition; skew detection and correction

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A new Automatic Formant Tracking approach based on scalogram maxima detection using complex wavelets

In this paper we present a new formant tracking algorithm where the formant frequencies estimation was based on local maxima detection of a time frequency representation. This representation can be shown by a scalogram issued from a complex wavelet transform. The formant frequency candidates are validated as local maxima of scalogram which correspond to wavelet ridges. Then in the proposed algo...

متن کامل

New Pseudo-CT Generation Approach from Magnetic Resonance Imaging using a Local Texture Descriptor

Background: One of the challenges of PET/MRI combined systems is to derive an attenuation map to correct the PET image. For that, the pseudo-CT image could be used to correct the attenuation. Until now, most existing scientific researches construct this pseudo-CT image using the registration techniques. However, these techniques suffer from the local minima of the non-rigid deformation energy f...

متن کامل

Staff Line Detection by Skewed Projection

Most optical music recognition systems start image analysis by the detection of staff lines. This work explores simple techniques from document image analysis, such as line segment extraction, to guide the staff line identification process. Specifically, the overall document skew is computed from the detected line segments. Staff lines are then projected in the direction of the detected skew an...

متن کامل

New Preprocessing Techniques for Handwritten Word Recognition

The research described in this paper focuses on the presentation of two novel preprocessing techniques for the task of off-line handwritten word recognition. A technique for the identification of straight and skewed underline noise is described along with a novel algorithm for detecting skew in handwritten words. The latter identifies skew by detecting the center of mass in each half of a word ...

متن کامل

New Preprocessing Techniques for Handwritten Word Recognition

The research described in this paper focuses on the presentation of two novel preprocessing techniques for the task of off-line handwritten word recognition. A technique for the identification of straight and skewed underline noise is described along with a novel algorithm for detecting skew in handwritten words. The latter identifies skew by detecting the center of mass in each half of a word ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015